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^ : Abstract 



The Degrees of Freedom (DoF) of a K-User MISO Broadcast Channel (BC) is studied when the 
Transmitter (TX) has access to a delayed channel estimate in addition to an imperfect estimate of the 
CO I current channel. The current estimate could be for example obtained from prediction applied on past 

■ estimates, in the case where feedback delay is within the coherence time. Building on previous recent 

works on this setting with two users, the estimation error of the current channel is characterized by 
CO ' its scaling as P^" where a = 1 (resp. a — 0) corresponds to an estimate being essentially perfect 

(resp. useless) in terms of DoF. In this work, we contribute to the characterization of the DoF region 
in such a setting by deriving an outerbound for the DoF region and by providing an achievable DoF 



H ' region. The achievable DoF is obtained by developing a new alignment scheme, called the Kq,-MAT 

■ 

scheme, which builds upon both the principle of the MAT alignment scheme from Maddah-Ali and 
Tse and Zero-Forcing to achieve a larger DoF when the delayed CSIT received is correlated with the 
instantaneous channel state. 



I. Introduction 

The use of multiple- antenna has been recognized during the last decade as a key element to 
improve performance in wireless networks due to the possibility to achieve a larger number of 
Degrees-of-Freedom (DoF), or pre-log factor, by transmitting several independent data streams 
at the same time dij. While in point-to-point MIMO systems, the maximal DoF can be achieved 
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without Channel State Information (CSI) at the Transmitter (TX), the exploitation of the multiple- 
antennas at the TX to achieve a DoF larger than one in multiuser settings heavily relies on the 
availability of accurate-enough CSI at the TX (CSIT). For instance, it is well known that in 
the A^-user Multiple-Input Single Output (MISO) Broadcast Channel (BC), the DoF is reduced 
from 7^ to 1 in the absence of CSIT [2] while full DoF is preserved if the variance of the 
channel estimation error falls as P^^ or faster, where P is the Signal-to-Noise Ratio (SNR) [3J, 
JH. Similar conclusions have been obtained in more general settings [|5]|, [l6]|. 

Yet, the obtaining of an accurate-enough CSIT represents a challenge in many settings. Indeed, 
the channel estimate has to be fed back from the RXs which inevitably introduces some delays 
and some degradations. Therefore, a large literature has focused on the problem of designing 
efficient feedback schemes and evaluating the impact of imperfect CSIT [See [|3l, [|71 and 
reference therein]. 

Recently, a new line of work was opened by the work from Maddah-Ali and Tse |[8l, 
Studying a J^-user MISO BC, they showed that even completely outdated CSIT, in the sense that 
the feedback delay exceeds the coherence period of the channel, could still be used to achieve a 
larger DoF than in the absence of CSIT. This is accomplished through a space-time alignment 
of the interference referred in the literature as the MAT alignment. Furthermore, if the channel 
matrices are independent and identically distributed over time and across the Receivers (RXs), 
the MAT scheme is then optimal in terms of DoF. 

This new method of exploiting stale CSIT has attracted a large interest and has been extended 
to further network scenarios. In [[TOl. [fTTI . the approach is adapted to two-user and three-user 
settings with multiple-antenna at the RXs, and to Interference Channels (ICs) and X-channels 
in [fT2ll - [fT5l . among others. In [fT6l . the IC with TXs having unequal CSIT is also investigated. 

Going beyond completely outdated CSIT, settings with CSIT of alternating qualities have been 
investigated. In [ITTll . a setting is studied in a block fading model where the CSIT is only accurate 
for some time slots and completely outdated during others. It is then shown that under some 
conditions the maximal DoF can still be achieved. Considering a more general CSIT model, 
the two-user MISO BC is studied in [[TBI in the case where the CSIT relative to one user is 
altematively perfect, completely outdated, or non-existent. It is then shown that the alternating 
between different CSIT configurations can lead to synergistic benefits. 

Yet, a major restriction of these works is that they all consider the delayed CSIT as being 
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completely uncorrelated with the instantaneous channel state. This assumption is lifted in fT9l 
where an improved DoF is shown to be achievable in the case where the delayed CSIT is assumed 
to be possibly correlated with the current channel state. As a consequence, an imperfect estimate 
of the current channel can be obtained by prediction based on the delayed CSIT. Specifically, it 
is assumed that the channel estimation error resulting from the prediction based on the delayed 
CSIT scales as P^" with a > being the CSIT quality exponent. Thus, when a is equal to 
one, the imperfect estimate of the current channel is essentially perfect in terms of DoF. On the 
opposite when a tends to zero, the estimate of the current channel is essentially useless. 

Building on the approach developed in [fT9l . the scheme was improved to reach the maximal 
DoF in a two-user MISO scenario [|20l . [|2TI . The scheme achieving the optimal DoF region in 
the two-user MISO BC is referred hereafter as the a- MAT scheme. This approach has then been 
extended to imperfect delayed CSIT in [l22l, [|2l and to two-user MIMO BC and IC in [24J. 
The study of delayed CSIT correlated to the instantaneous channel state has always remained 
restricted to the two-user case and the results do not trivially extend to more users. Finding 
the DoF region and extending the ct-MAT alignment to more users is precisely the goal of this 
work. 

Specifically, our main contributions are as follows. 

• As a preliminary step, we develop a new alignment scheme, called the A-MAT scheme, to 
exploit completely outdated CSIT. This scheme can be seen as an extension of the alternative 
version of MAT for the two-user case and is more adapted to the combined use of ZF and 
alignment based on delayed CSIT. Yet, its performances are suboptimal. 

• We derive an outerbound for the fC-user MISO broadcast channel with delayed CSIT and 
imperfect current CSIT with quality exponent a. 

• We develop a new scheme which combines the A-MAT alignment scheme and Zero-Forcing 
(ZF) in such a way that the sum DoF takes the simple form (1 - a) DoF'^"'^^'^ +aDoF^^, 
where DoF^"^^'^ and DoF^^ are the sum DoF achieved respectively with the A-MAT 
scheme and with ZF. 

Notations: The complex circularly invariant Gaussian distribution of mean fi and variance 
is denoted by A/'c(0, a^). fix) ~ gix) denotes the fact that lim^._^oo 44 = C with C 7^ 0. The 
jth element of the ith row of the matrix A is denoted by {Ajjj. The function log represents the 
logarithm with base 2 and || A||f the Frobenius norm of the matrix A. A ^ is used to represent 
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the fact that the matrix A is positive semidefinite while A ^ B denotes the that A — B ^ 0. 
If A is a positive definite matrix, A^/^ denotes the unique lower triangular matrix with strictly 
positive coefficient obtained via the Cholesky factorization such that A = A^/^(A^/^)^. We 
write wlog for without loss of generality and i.i.d. for independently and identically distributed. 

II. System Model 

A. K-User MISO Broadcast Channel 

This work considers a /C-User MISO BC where the TX is equipped with M antennas and 
serves K single-antenna users. We assume furthermore that M > K. Ai any time t, the signal 
received at RX i can be written as 

y,{t) = hf{t)x{t) + z,{t) (1) 

where hf E ^ixm (.j^g channel to user i at time t, x E C^^^^ is the transmitted signal, and 
Zi(t) G C is the additive noise at RX i, independent of the channel and the transmitted signal 
and distributed as A/c(0, 1). Furthermore, the transmitted signal x{t) fulfills the average power 
constraint E[||a;(t)|p] < P. 

We define further the channel matrix H = [^i, . . . , hx]^ E C^^'^^ and introduce the nota- 
tion = {H(A;)}^^* . The channel is assumed to be drawn from a continuous ergodic distribution 
such that all the channel matrices and all their submatrices are full rank. 

B. Delayed CSIT with Correlation in Time 

The considered CSIT model builds on the delayed CSIT model introduced in [8] and gen- 
eralized to account for time correlation in [19|. According to this model, the TX has access 
at time t to the delayed CSI. It takes the form of the CSI up to time t — 1 which is denoted 
by ^ Furthermore, exploiting the correlation in time between the delayed CSI and the 
current channel state H(t), the TX produces an imperfect estimate of the channel state denoted 
by H(t). This channel estimate is then modeled such that 

H(t) = H(t) + H(t) (2) 

where the channel estimate and the channel estimation error are independent, the channel estima- 
tion error H(t) has its elements i.i.d. A/'c(0, cx^) while the elements of the channel estimate H(t) 



are assumed to have a variance equal to 1 - a^. We further define "H* = {Il{k)}lz\ and 

It is also assumed that the channel state H(t) is independent of the pair (W^^ when 
conditioned on H(t). 

The variance of the estimation error is parameterized as a function of the SNR P such 
that (T^ = P^" where we have defined the CSIT quality exponent a as 

« = lim -j — 7^^. (3) 

P^oo log(P) 

Note that from a DoF perspective, we can restrict ourselves to a G [0, 1] since an estima- 
tion/quantization error scaling as P^^ is essentially perfect while an estimation error scaling 
as P° is essentially useless in terms of DoF. 

Remark: This suggests that in order to keep the rate scaling in the SNR, and under a given 
time-correlation model, the feedback delay as a fraction of the correlation time must shrink as 
the SNR increases (e.g., the terminal velocity must decrease). 

Note furthermore that for any ZF precoded vector u such that hfu = 0, it can easily be 
shown that E[|/ifMp] ~ p-"E[||'u||2]. 

Following the conventional assumption from the literature of delayed CSIT (e.g., in [9J), all 
the RXs are assumed to receive with a certain delay both the perfect multiuser CSI and the 
imperfect CSI. This CSI is used only for the RX to decode its data symbols such that the only 
limitation for this delay lies in the delay requirement of the data transmitted. The CSI at the RX 
side could for example be obtained if each user broadcasts is CSI implying that the other RXs 
can obtain the same CSI as the TX. Another solution is to simply let the TX send its perfect 
delayed CSIT to all the RXs [|25l. 

C. Degrees-of-Freedom Analysis 

Albeit an incomplete measure of system performance, the DoF offers the unique advantage 
of allowing for analytical tractability for even complex network models and feedback scenarios 
such as this one. Let us denote by V* the DoF-region, which is defined as follows. 

V*^[ (di, d2, . . . , ci^) |3(Pi(P), . . . , Rk{P) e C{P) , s.t. = 1,. . ., K, = lim ] 

[ P^oo log(P) J 

(4) 
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where C{P) is the capacity region. Furthermore, the maximal sum DoF will also be of particular 
interest in this work. We denote it by DoF* and define it such that 



K 

DoF* = max } di. (5) 

{di,...,dK)(^V* ^ 
1=1 

III. Main Results 
We provide in this section our main results. 

A. Outerbound 

We start by describing an outerbound for the DoF region, which will then be proven in 
Section |Vll 

Theorem 1. In the K-user MISO BC with perfect delayed CSIT and current CSIT with quality 
exponent a, the DoF region V* is outerbounded by P^ut jgyj^^^j ]yy 

p J _ p 

k ~ ^ k 



k=l k=2 



\fi e {1,...,K}, 0<di<l. (7) 

where Sp is the symmetric group containing all the permutations of {1, . . . ,p}. In turn, the sum 
DoF is upperbounded by DoF^^* defined as 



DoF«- = - ; - . (8) 

2^k=l k 



Proof: The detailed proof is provided in Section |Vl] ■ 
It can be seen that this bound subsumes several known outerbounds from the literature. For a = 
0, it coincides with the optimal DoF achieved by the MAT algorithm while for a = 1, the DoF 
in a MISO BC with perfect CSIT is obtained. Finally, for K = 2, this outerbound simplifies to 
the optimal DoF region provided in [20]. 
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B. Achievable DoF 

The problem of constructing a scheme achieving the outerbound in Theorem [T] is very intricate 
and remains open. This is due to the difficulty to combine ZF (which is optimal for a = 1) 
with the MAT scheme (optimal for a = 0). The scheme for the two-user case developed in [20], 
[EH avoids this problem by using an alternative version of the MAT scheme developped by 
Maddah-Ali and Tse in [8J. In contrast with the original MAT scheme, this alternative version 
can be nicely combined with ZF such that the optimal DoF could then be achieved [[20|. [[2T]|. 
This alternative version does not seem applicable for more than two users. As a consequence, our 
first step has been to find a new alignment scheme based on completely outdated CSIT, which, 
to some extent, generalizes the alternative MAT version to the case of more users. This scheme, 
denoted hereafter as the A-MAT scheme, is described in Section |IV] and shown to achieve the 
following DoF. 

Theorem 2. In the K-user MISO BC with completely outdated CSIT (a = 0), the A-MAT 
scheme achieves a sum DoF equal to 

2K I / 2K-3 



where the number nxs of time slots over which the A-MAT scheme is spread is 

K(K-l) K(K-l) K^(K + 1) 
riTS = ^ + ^-^ ^ + ^ - K{K - 1). (10) 

Hence, it holds 

lim DoF^-^^T^ 

nTS-s>oo K+1 



The A-MAT scheme can easily be adapted to exploit the correlation between the delayed CSIT 
and the instantaneous channel state. The modified scheme, denoted as the Kq,-MAT scheme, will 
then be shown in Section |V] to achieve the following DoF. 

Theorem 3. In the K-user MISO BC with perfect delayed CSIT and current CSIT with quality 
exponent a, the DoF achieved with the Kq,-MAT scheme is equal to 

j3QpK.-MAT = (1 _ c,) DoF^™ +a DoF"^ (12) 
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with DoF^^ = K. 

The DoF achieved with ZF for the CSIT quality exponent a is well known to be equal to the 
second term of (fT2l) [3J. Hence, the K^-MAT scheme outperforms ZF and appears as a robust 
ZF scheme with respect to delay in the CSIT. The first term of (fT2l) is the DoF improvement. 

IV. The A-MAT Scheme 

Similarly to the MAT scheme, the A-MAT scheme does not exploit the correlation in time 
and hence treats the estimate as completely "stale". Although suboptimal, the A-MAT scheme 
can be easily adapted to exploit the time-correlation and henceforth will be a key component to 
develop a scheme which outperforms both MAT and ZF when a > 0. Similarly to flU, a DoF 
strictly larger than one will be achieved by exploiting the broadcast nature of the channel. This 
means that a message destined to j users (called order-j messages) will be overheard by another 
K — j users, hence providing side information which can be exploited. As a consequence, we 
will also define DoFj as the DoF with which order-j messages are transmitted. Note that with 
this notation, our objective is to transmit order- 1 messages and to maximize DoFi. 

When no confusion is possible, we omit to mention the dependency of the channels as a 
function of the time t. 

A. Example of the A-MAT Scheme for K = 3 

The A-MAT scheme consists of one initialization step, followed by a number of "main 
iteration" steps and is ended by a termination step. 

• Step 1-Initialization- This step consists of 3 time slots and takes as input 4 order- 1 symbols 
for every user. During the first time slot, the vector Ui E C^^^ containing 2 data symbols 
for RX 1 and the vector U2 E C^^^ containing 2 data symbols for RX 2 are transmitted. 
The received signal at RX i can then be written as 

Vi = hfui + hfu2 + z,. (13) 

Following the same philosophy as the alternative form of the MAT scheme |l9l, the interfer- 
ences hfu2 and h^ui are transmitted to both RX 1 and RX 2. Indeed, these equations are 
needed at both RXs because they represent, for one of them, the received interference, and 
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for the other, a second independent observation of the desired signal. Hence, the transmission 
of the 4 order- 1 data symbols has been replaced by the transmission of 2 order- 2 data 
symbols. During the second (resp. the third) time slot, the same transmission scheme is 
used to transmit to RX 2 and RX 3 (resp. RX 3 and RX 1). 

• From step 2 to step n + l-Main iteration step- We assume that 6 order-2 data symbols need 
to be transmitted to every user from the previous step. This phase is spread over 6 time 
slots and takes as input 3 order- 1 messages for each user as well as the 6 order-2 messages 
from the previous step. 

In the first time slot, 3 order- 1 messages are transmitted to RX 1 while 2 order-2 messages 
are transmitted to RX 2 and RX 3. We define the vector ui E C^^^ containing the 3 order- 1 
messages and the vector 1*23 G C^^^ containing the two order-2 messages. The received 
signal at RX i reads then as 

Vi = hfui + hfu23 + Zi. (14) 

Let the interference hfu23 be transmitted to all the RXs, the interference hfui be trans- 
mitted to RX 1 and RX 2 and the interference hfui to RX 1 and RX 3. Each RX can 
then decode its desired data symbols. Indeed, each RX could then remove the interference 
received as well as receive the right number of additional independent equations to decode 
its desired messages. Thus, hfu23 can be seen as an order-3 message while h^ui and h^ui 
are order-2 messages. The transmission of the input data symbols has been replaced by the 
transmission of two order-2 messages and one order-3 message. During the two following 
time slots, the same transmission occurs after having permuted circularly the role of the 
RXs. 

Finally, the three order-3 data symbols are broadcasted, which requires 3 time slots. In total, 
6 order-2 data symbols have been transmitted and 9 order-1 data symbols. At the same time, 
6 order-2 messages have been generated (from the overheard interference) and have to be 
transmitted in the following step. 

• Step n + 2-Termination- At the beginning of this phase, 6 order-2 data symbols have to be 
transmitted. This is carried out by simple broadcasting, and hence requires 6 time slots. 

In total, 12 + 9n order-1 data symbols have been transmitted in 6 + 6?t, + 6 time slots. After 
simplifications, the DoF given in Theorem |2] is then obtained. As the number of main iteration 
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Fig. 1. Symbolic representation of the A-MAT scheme for K = 3 users. 



Steps n increases, the DoF converges to 3/2. 

The mains steps of the A-MAT scheme for = 3 are illustrated in Fig. \T\ A particularity of 
A-MAT is that symbols of different orders are sent at the same time. 

Note that the number of order- 2 symbols transmitted is exactly equal to the number of order- 
2 messages created. This represents a particular case and for K > 3, it will be necessary to 
consider several transmissions of symbols of different orders so as to reach an equilibrium where 
the number of data symbols of order- j with j >2 taken as input equals the number of symbols 
of order j. 
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B. Description of the A- MAT Scheme 

We will now describe the A-MAT scheme for arbitrary values of K. The A-MAT algorithm 
can be divided in distinct phases which we denote as order- j phase. We will start by presenting 
the order-j phase before moving to the description of how such phases are combined in the 
A-MAT scheme. 

Note that each step should be carried out K times for the K circular permutations of the users. 
This is necessary to ensure that every user is transmitted the same number of data symbols. For 

clarity, we will present the scheme for one particular RX configuration only. 

1) Order- j Phase: The order-j phase consists in the simultaneous transmission of messages 
of order- j and of messages of order- — j). We assume wlog that the order- j messages are 
destined to RX 1, RX 2, . . ., RX j, while the order- (X—j) messages are destined to the remaining 
K — j users. We will discuss later on how these messages of order-j and order- — j) are 
obtained. In one time slot, the vector uj e c(^~-'+^^^^ containing the K — j -\- 1 data symbols 
of order-j and the vector uk-j e C^-'^^^^^ containing the j -\- 1 data symbols of order-(i^r — j) 
are transmitted. 

Hence, the received signal at RX i can be written as 



For i = 1, . . . , j, hfux-j represents an interfering signal which is desired at RX i in order 
to remove the interference. Yet, this is also of interest to RX k for k = j + 1, . . . , K since it 
represents an additional equation in uk-j- Thus, hfux-j can be seen as an order-(ii' — j 
message. 

Similarly, for i — j -\- 1, . . . , K, hfuj represents an interfering signal at RX i but is also 
of interest to RX k for k — 1, . . . , j. The messages hfuj for i — j -\- 1, . . . , K are then of 
order- (j -|- 1). 

If the j order-(_ft'— j' + l) messages and the K —j order-(j + l) messages are transmitted to the 
RXs who desire these messages, each RX can be seen to have enough interference-free equations 
to decode its messages. Indeed, the first j (resp. last K — j) RXs have received K — j (resp. 
j -\- 1) independent equations, which is exactly equal to the number of independent data symbols 
that they need to decode. The number of time slots uts required for this is then equal to 



yi = hi Uj + hi UK-j + Zi. 



(15) 



riTs = 




+ 1 



(16) 
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where the addition of a 1 corresponds to the one time slot used for the transmission in (fTSl ). 
During the rixs time slots, K — j + 1 order-j messages and j + 1 order- (i^ — j) messages can 
then be successfully transmitted. From the definition of the DoF, we can then also write rixs as 

Putting together ([l7]) and ([HI) yields 

7 + 1 K - j + 1 K-j j 

+ ^ \7 = 7^^-77 + :--T^ + 1. (18) 



BoF K-jiK,K) DoFj DoFj+i{K,K) DoFx-j+i 
2) The A-MAT Scheme: The order-j phase assumes that messages of order-j and messages 

of order- (if — j) need to be transmitted. We will now show how the order-j phase are combined 

in the A-MAT scheme to allow for the transmission of order- 1 data symbols. 

The proof that the A-MAT scheme successfully transmit the data symbols and the derivation 

of the DoF will be done in the following subsection. We present the A-MAT for the case K 

odd and the modifications required when K is even will be described hereafter. 

• Step 1-Initialization- The order-j phase is carried out for j = 1, . . . , {K — l)/2 but for 
every phase, the messages of higher order are replaced by the order- 1 symbols that we aim 
at transmitting. This is done by choosing arbitrarily any RX among the j destined RXs 
since the messages are transmitted so as to be decoded at each of the j RXs. This step 
is spread over {K — l)/2 time slots and leads to the creation of messages of order j for 
j = 2, . . . ,K. The number of messages of order-j generated can be obtained from ([T9] ). 
One message of order- is generated and is directly transmitted via broadcasting. 

Note that for clarity a different initialization has been used for = 3 in Subsectior lIV-A[ 

• Step 2 to step (n + \)-Main Iteration- For every iteration step, all the order-j phases are 
carried out once for j = 1, . . . , — l)/2. At the nth step, the order-j data symbols being 
sent are the ones which have been generated during step {n — 1), where the initialization 
corresponds to step 0. The verification that the number of data symbols created matches 
the number of data symbols needed as inputs will be done in the next subsection. 

• Step n + 2-Termination- All the data symbols which need to be transmitted are simply 
broadcasted. This phase can be seen after summation of all the equations given by (fTTI) to 
require K{K + I) /2 - I - {K - I) time slots. 

If K is even {K — l)/2 is replaced by K/2 — 1 and the order- _ft'/2 phase is carried out only one 
time every two steps. The number of time slots used for the termination remains unchanged. 



13 



C. Sum DoF Achieved 

We will now show that this scheme can indeed be used to achieve the DoF given in Theorem [2l 
We start by proving the following lemma. 

Lemma 1. For every j ^ l,K, the number of data symbols taken as input in one A- MAT 
iteration is equal to the number of order j messages generated in such an iteration. 

Proof: A detailed proof is provided in Appendix El ■ 
Using Lemma [H we can compute the DoF achieved by the A-MAT scheme by observing 
how many time slots are used and how many order- 1 data symbols could be transmitted during 
those time slots. Let us consider for the moment K to be odd. 

• -Initialization- The initialization step is spread over {K+l)/2 time slots and K{K+1) /2—1 
order- 1 data symbols are taken as input. 

• -Main iteration step- At every time iteration, K order- 1 data symbols are taken as input and 
each iteration is spread over {K + l)/2 time slots. According to Lemma [H the number of 
order-j symbols created in every iteration with j > 2, is the same as the number of order-j 
messages transmitted. Thus, the DoF of one iteration step is K / ({K + 1) / 2) = 2K/ {K+ 1). 

• -Termination- The termination step requires K{K + l)/2 — K time slots to broadcast all 
the remaining data symbols. 

To compute the DoF achieved, it is necessary to take into account the need to consider for 
every steps the K circular permutations between the users. Hence, the total number of time slots 
over which the A-MAT scheme is spread is equal to 

. {^l.n!^±l^ A-(A- + l)-2(A--l) ^ ^^^^ 

where the first term in the RHS of (fTTT ) corresponds to the initialization, the second term to the 
n main iteration steps, and the third one to the termination step. 
In total, the DoF achieved by the A-MAT after n steps is then 

j3^pA-MAT.^ ir) = , ^ , , ^ , (20) 

(^) + n (^) + (^(^±i)^^) 

which gives after some basic manipulations the expression in Theorem |2l 

As the number of time slots increases, the A-MAT scheme achieves a DoF of 2K/{K + 1) 
based on completely outdated CSIT. Although the sum DoF of this new scheme is smaller than 
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Fig. 2. Sum DoF in terms of the number of users K. 



the one achieved with MAT, it provides an alternative way to exploit delayed CSIT which will 
make the exploitation of the prediction obtained from the delayed CSIT more applicable. The 
A- MAT scheme is compared to the MAT scheme in Fig. [2l 

V. The K„-MAT Scheme 

When the CSIT is completely outdated (a = 0), we will use our new A-MAT scheme in 
place of the MAT scheme. In the other extreme, when a = 1, ZF is well known to be DoF 
achieving. Thus, it remains to develop a scheme for the intermediate values of the CSIT quality 
exponent a. Extending the A-MAT scheme to this case will in fact prove to be very easy: The 
DoF achieved with the modified scheme, which we denote as the Kq,-MAT scheme, will go 
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linearly from the DoF achieved with the A-MAT scheme to the DoF achieved with ZF as the 
CSIT quality exponent a increases. 

Note that the sum DoF obtained with the outer bound given in Theorem \T\ for a CSIT quality 
exponent a is equal to (1 - «) DoF'^^'^ +a DoF^^ where DoF'^'^'^ is the DoF achieved with 
MAT alignement. Hence, if A-MAT were optimal for a = 0, K^-MAT would then be optimal 
for arbitrary values of a. This it the case for K = 2 where A-MAT coincides with the alternative 
version of MAT. As a consequence, the Kq,-MAT scheme is also optimal. In fact, the K^-MAT 
scheme matches then with the optimal scheme from [|20l . [[211 . 

We will start by describing the different steps of the Kq,-MAT scheme before moving to the 
analysis of the DoF achieved. 



A. Description of the Kq,-MAT Scheme 

We will show how the order-j phase of the A-MAT scheme is modified to exploit the 
correlation between the delayed CSIT and the instantaneous channel. The full K^-MAT scheme 
follows then trivially from the description of the A-MAT scheme in Section |IVl 

We assume wlog that the order- j symbols are destined to the first j TXs and the order- (fC—j) 
symbols to the K — j last RXs. 
• Direct Transmission: 
a) The A-MAT Data Symbols: According to the A-MAT scheme, the TX transmit K-j + 1 
order-j messages and j + 1 order-(_ft' — j) messages. Yet, the data symbols are this time 
precoded. The ith order-j data symbol is precoded to form the vector a.^"* G C*^^^ while 
the A;th order- (K — j) data symbol is precoded as the vector a!'^^^^ G C*"^^^. The vector 
a^'^ is chosen to ZF the interference to the K — j last RXs, i.e., such that 

Wk = j + 1,...,K, h'^a?=0. (21) 

The remaining K — j precoded data symbols are chosen such that \/k < i, (a^"'^)^ap'' = cQ. 
Similarly, a[^~^^ is chosen such that 

Vfc = l,...,j, h^a[''-'^=0 (22) 

'Note that this is solely done to ensure that all the precoded data symbols are linearly independent and span a subspace of 
dimension K — j + 1. 



16 



and the remaining j beamformers such that \/k < i, (a), ) o-l 
The power is allocated to these precoded data symbols as follows. 



0. 



A; = 1, 



a 



(i)ii2 



K-j + l 



(J)||2 



and similarly 




a 



\a 



(^-i)ll2 



Up- p") ■ 

pi— a 



2 

1 1 

2K-j+V 



1 K-j pl-a 

2 K~j+1 



(23) 



1 ( P — poi\ _ l_2_ pi 

2 V-* / 2i+l-' 



a 



(^-i)ii2 



(24) 



1 1 pl-a 

2 j+1 



The reason for this particular power allocation will become clear in the decoding part of 
the scheme. Every data symbol is sent with the rate (1 — a) log(P). 
b) The ZF Data Symbols: In addition to these data symbols, we will transmit at the same 
time via conventional ZF one data symbol sj to RX j (i.e an order- 1 data symbol) for every 
RX j. Hence, the data symbol sj is precoded to obtain Uj E C*^^^ such that 

\fkj^j,h'^u,=0. (25) 

The power is allocated to verify that Vz, £[1111411^] = P^ / K and each data symbol is sent 

with the rate alog(P). 

The received signal at RX k then reads as 



K-j+l 



i+1 



K 



k<j, 



Vk 



Zk 



i=2 



i=l 



i=l 



i+1 



K-j+l 



(26) 



i=2 



i=l 



i=l 



^pl — a 



(?) (K~j) 

Note that the interferences from and a\ have been attenuated by P^" following 
the ZF with respect to the imperfect channel estimates. 

Creation of the A-MAT Order- j + 1 Data Symbols: Considering the received signal scaling 
in P" as noise and omitting the power scaling of the received signals, we have obtained 
the same received signals as in the A-MAT scheme described in Section |IVl Hence, the 
interference h^af^^-^^ for k < j is needed to remove the interference at RX k but forms 
also a desired equation for the last K — j users. Thus, it can be seen as an order-(iC — j + 1) 
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message. Similarly, the interference J2i ^^^i A; > j + 1 is needed by the first j RXs 
and by RX k, and is hence an order- (j + 1) message. 

All the "equations" which have to be retransmitted have a power scaling in P^^". Hence, 
we can use the well known result that quantizing them with (1 — a) log(P) bits leads to a 
distorsion scaling in P° [|261 . which is negligible in terms of DoF. 

The data symbols of order-j and order-(K — j) taken as input have a rate of (1 — a) log(P) 
and this is also the case of the new messages created. As a consequence, the A-MAT 
algorithm can proceed with the transmission of the quantized equations as the order- (j + 1) 
and order- (i^ — j + 1) messages for the next iteration of the A-MAT scheme. 

• Successive decoding: We now consider that the modified A-MAT has reached its end. Let 
us first consider RX k for k < j. This RX has received K — j equations relative to its 
order-j symbols and was also able to remove the interference received. Hence, it has in 
total K — j + 1 equations having each a SNR scaling in P^~". Consequently, RX k can 
decode all the desired precoded data symbols a[-'^ for all i. 

• Successive decoding: We now consider that the modified A-MAT has reached its end. Let 
us first consider RX k for k < j. This RX has received K — j equations relative to its 
order-j symbols and was also able to remove the interference received. Hence, it has in 
total K — j + 1 equations having each a SNR scaling in P^^". Consequently, RX k can 
decode all the desired precoded data symbols a[''^ for all i. 

The data symbols of order-j being decoded, they can be subtracted from the received signal. 
Since the interference have also been subtracted, the received signal at RX k reads then as 



The interference term in (|27T ) is drawn in the noise due to the attenuation by P~" from the 
ZF precoding. As a consequence, the precoded symbol is received at RX k with a SNR 
scaling as P" and can be decoded. 

The same analysis can be carried out for RX k with k > j. 



K 




(27) 
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Fig. 3. Sum DoF for K = 5 users in terms of the CSIT quality exponent a. 



B. Degrees of Freedom Analysis 

From the description of the algorithm, the DoF expression from Theorem |3] is easily derived 
as follows. The A- MAT scheme has been used to transmit data symbol of rate (1 — a) log(P) 
while at every time slot of this scheme, one data symbol has been transmitted to every user via 
ZF with a rate equal to Q;log(P). Hence, the DoF given in Theorem [3] can be achieved. 

In Fig. [3l we represent the sum DoF achieved with the K^-MAT scheme. Although the MAT 
scheme is optimal when a = and the CSIT is completely outdated, the A-MAT scheme 
becomes more efficient as the CSIT quality exponent increases. The Kq,-MAT scheme coincides 
with ZF when the CSIT is accurate enough (a = 1) and is otherwise more performing. Hence, 
it can be seen as a robust version of ZF with respect to the delay in the CSIT. 
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Furthermore, we show in Fig. |4]the DoF achieved in terms of the number of users with the 
CSIT quality exponent a = 0.5. It can be seen that the Kq,-MAT scheme outperforms in that 
case both ZF and MAT. 
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Fig. 4. Sum DoF in terms of tfie number of users K for the CSIT quality exponent a — 0.5. 
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VI. Proof of the Outer Bound 

To obtain the outer bound, we adopt a genie-aided upper bounding technique inspired from 
[fTOl . [|20l . We provide to RX i the side information of the RX j's message Wj as well as 
the received signal yj(t'),W < t for j = i + 1, ■ ■ ■ , K. We consider that all the K users are 
active (i.e., have a positive DoF) because the approach trivially extends by replacing K with 
any number p of active users such that I < p < K . Recall that all the RXs have access after a 
given delay to the perfect CSI H(t) as well as the imperfect CSI ii{t). Since the decoding of 
the signal received at time t is done solely once the RX has received the CSI relative to time t, 
it means that we can consider that the RXs have access to the CSI instantaneously. We further 
define for ease of notation W[i:j] = {Wi, Wi+i, ■ ■ ■ , Wj}, Y[i.,j]{t) = {yi{t),yi+i{t), ■ ■ ■ ,yj{t)}, 
H[i:i](t) = [hi{t), hi+i{t), ■■■ , hj{t)]^, where j > i, and Y^...^ = {Y[i;j](m)}^=i. 

From Fano's inequality, it follows for arbitrary e„ > 0, 

n{Rk-en) < I{Wk;W[k-,i:K],Y^k:K]\ii\^l (28) 
= I{W,; Yl^^\W^,+,.,K], H", H") (29) 

n 

= Y[,.x](t)|W^[fc+i:X], Yf-],],H^H'^) (30) 

t=i 

n 

= J2 (/i(Y[,.x](t)|W^[.+i:X], Y[-^j,H*,H*) - /.(Y[,.^](t)|iy[fc.^], Y[-^j,H*,H*)) 

t=i 

(31) 

n 

= {h{Y[k:K]{t)mt),H{t)) - h{Y[k:KmwkM{t),m))) o^) 

t=l 

where we have defined Uk{t) = Y[^L, H*^^, H*}. Thus, the weighted sum rate can 
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be bounded for arbitrary nonzero natural number N^^k — 1, K as 



K 

E 



n{Rk - En) 



k=l 

n K ^ n K 



t=l k=l ^ t=l k=l ^ 

n K-l . 



t=l k=l 



n K-l . 

^ E E Y[fe:K] W I ) , H (i) ) - Y[.+l:i.l W I W^.+l , Wfe+i (i) , ^ 

t=l k=l ^ ^ 

+ nlogP + n-0(l) (35) 

= EE {^KY^,.,Kit)pik{t),m)) - -^h{Y^,^,.,Kit)mt),m)) 



+ ^My/^WI^^/^W,H(i))-— MY[i:^l(i)|W^i,Wi(i),H(i)) (34) 

iVft- iVi 



t=l k=l 



n K-l 

+ nlogP + n-0(l) (36) 
Let us focus on one of the differences of entropy in the summation. We can apply the same 
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calculation as in the proof of the outerbound in [20|. Firstly, v/e. set\/k, Nk = K — k + 1 to write 

f hiY[k:K]it)mt),Uit)) /^(Y[,+l^,,](t)|^Y,(t),H(t)) ^^ 

< inaX ^ -7 ; ; ■ — ; (37) 

p(Wfc(t)),p(x(i)|Wfc(t)) V K -k + 1 K -k J 

^ f hiY^,.,Kimk{t),u{t)) hiY[,+,..Kit)mt),H{t)) \ 

S W-diX tj?/, (t) max — ; — ; (Jo) 

Wfe(t)) ' p{Mt)\Ut:{t)) \ K-k + 1 K-k J 

~ ^^I^t^M max,... ^mt)mt) K-k + i 

(39) 



u,{t)) pi^itmit)) ^''>^-''''\ K-k + l K-k 

'h{li[k:K]{tMt) + Z[k:K]{t)\Uk{t)) 



f] 

maxE/y, m max - 



K-k + l 

/t(H[fc+i:j^](t)x(t) + Zlk+l:K]{t)\Uk{t)) 

K-k 

h{liyk-.K]{t)^{t) + Z^k-.K]{t)\Uk{t)) 



(40) 



max li/7y, (J) max max i^T^(t\\i^(t\ 77 ; t 

U,(t)) "'^^^^ CbO p{^{t)\U,{t)) H(t)|H(t) \^ K-k + l 

tr(C)<Pcov(x(t)|Wfc(t))dC 

/l(H[fe+i;K](t)x(t) + 2;[fc+i:X](t)|Wfe(t)) 



(41) 



where (|39| ) is obtained because maximizing inside the expectation leads to an upper bound and 
(|4TI) follows from splitting the constraint on the distribution in two constraints. 

We can now apply the Extremal Inequality from [l27l, Theorem 8]. This is possible because 
x(t) is independent of H(t) (and of the noise) conditioned on the channel estimate H(t). The 
multiplication by the channel matrices (not present in the original theorem) is taking care of by 
inverting the channel after having regularized it, and letting then the regularization tend to zero 



It follows from that result that the optimal vector x(t) is Gaussian distributed. We define then 
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the covariance matrix Kx(t) = E{x(t)x^(t)|Wfc(t)} and write 



< 

Uk(t)) CbO Kx(t) 

tr(C)<P 



maxEw,(t) max ^max^ EH(t)|H(t) ^ _ ^ ^ ^ log det (I /^-fc+i + H[fc:;^] (t) (t ) Hg^^j (t) ) 

1 



K -k 



logdet(Ii,_fc + Hik+i:K]{t)K^{t)iif,^i..K]{t)) (42) 



maxEi^,(i) max EH(t)|Hft) ( ^_'^^^^ logdet(Ij^_fc+i+H[fc;;^](t)K*(t)Hg^;^](t)) 

* tr(C)<P 



^ logdet(I^_fc + H^k+i:K]{t)K*{t)Hf,^,.j,^{t)) ) (43) 



Wfc(i)) 

tr(C)<P 



1 



<maxEw,(i) max EH(t) |H(t) ( ^ _ ^ ^ ^ log det + Hj^^;^] (t) C (t) Hg^^^j (t) ) 



- logdet(I^_, + H[,+i.^](t)C(t)H«+,^^.](t))^ (44) 

< ^_fc + l «lQg-P + Qa) (45) 



where we have defined K* as the covariance matrix solution of the inner maximization in (1421) . 
Inequality a is a consequence of the following lemma which is proven in Appendix |B} 

Lemma 2. Let us consider two x M (k = 1,2) random matrices = + H,t. where 
Hfc has its entries distributed as i.i.d. A/c(0, o"^) and independent of ilk- Given any K ^ with 
eigenvalues Ai > • ■ ■ > Am > 0, and M > Ni > N2, if cr^ tends to zero, then 

i-Ejj, logdet(I;v, +HiKHf ) - i-E^, logdet(I^, +H2KHH) < \og{a^) + 0(1). 

(46) 



Using dM]) in ^ with Nk = K -k + l,ii follows that 

fc=l t=l fc=l 
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Dividing by n log(P), considering arbitrarily long codewords, and letting P tend to infinity gives 



k=l k=l 

K 



= 1 + (49) 



k 

k=2 

By permutation of the users and variation of the number of active users, all the outer bounds 
can be obtained. This concludes the proof. 
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VII. Conclusion 

In this work, considering a X-user MISO BC, a new transmission scheme has been developed 
to exploit at the same time the principle behind the MAT alignment based on delayed CSIT 
and ZF of the interference. The novel K^-MAT scheme is more robust than ZF to the channel 
estimates being received with some delay and coincides with ZF when the CSIT received is 
accurate enough. Furthermore, over a wide range of values taken by the CSIT quality exponent a, 
the Kq-MAT scheme outperforms both MAT and ZF. This makes such approach a strong 
candidate to improve the robusteness to CSI feedback delays of the transmission scheme. In 
addition, an outer-bound DoF region has been derived. How to reduce the gap between the outer 
and the inner bound is an interesting open problem for futur research. Furthermore, the MAT 
alignment scheme from Maddah-Ali and Tse is very recent and is expected to have applications 
in many more settings and to have a strong potential for further improvements. 
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Appendix A 
Proof of Lemma [H 

Proof: Let us recall first for the sake of clarity the DoF expression for the order-j phase 

J + 1 K-j + l K-j , J 



BoFk-j DoFj DoFj+1 DoFa'-j+i 
Rewriting this expression for the order-j + 1 phase gives 



L (50) 



J + 2 K-j K-j-1 j + 1 

+ ——^ = ——^ ^ W-F ^1- (51) 



DoFK-j-i DoFj+i DoFj+2 BoFk-j 
and for the order-j — 1 phase 



j K -j + 2 K -j + 1 j - 1 

+ ^ ^ = ^ \^ + + 1- (52) 



DoFk-,+1 DoF,_i DoF, DoFa-j+2 

Adding dSO]) and dlB, the first term of the Left-Hand Side (LHS) of dSO]) simplifies with 

the second term of the right-hand side (RHS) in (ISH) while the first term of the RHS of (|50l) 

simplifies with the second term of the LHS of (|5TI) . Similarly, adding (l50l) and (|52|) . leads to 

the simplification of the second term of the LHS and the second term of the RHS in (l50l) with 

their counterpart in (l52l . 

As a consequence, adding the equations obtained from phase 1 to phase k yields 

K k+1 _ K - k 1 
DoFi ^ DoF^-fc ~ DoFfc+i ^ DoF^ ' ^ 
We now differentiate between the two cases K even and K odd. 

• If K is odd, then choosing k = {K — l)/2 in (l53l) gives 

' +^^. (54) 



DoFi DoFa 2 

because it holds in that case that K — k = k + 1 such that two terms simplify in (|53T ). The 
proof concludes by using that DoF k{K, K) = 1. 
• If iiT is even, writing (l53l) with /c = K/2 — 1 gives 

+ = + ^^-T^ + IT - 1- (55) 



DoFi DoFk ,1 DoFk DoFa- 2 

2 ^ 2 

We proceed by writing the DoF expression (|50] ) for the order- 7^/2 phase which gives 

K + 2 K 

+ 1. (56) 



DoFk DoFx ,1 

2 2 """-^ 

Adding one half of ([561) to ([55]) gives (|54l ). 
The result follows directly from (l54T i since the expression relative to the symbol of order-j for 
j 7^ 1, K have been simplified. ■ 
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Appendix B 
Proof of Lemma [2] 

We will proceed by bounding first separately each term of (l46T l. 

• Let us consider first the second term which we should lower bound. Recall that we consider 
two NkX M (k = 1, 2) random matrices = Hfo+H^, where has its entries distributed 
as i.i.d. A/c(0,cr^) and independent of Hfe and a matrix K ^ of size M x M with 
eigenvalues Ai > ■ ■ ■ > \m > such that M > Ni > N2. We also define the Eigenvalue 
Decomposition (EVD) of the positive semi-definite matrix K such that K = VAV^ with V 
a unitary matrix of size M x M and A = diag(Ai, A2, . . . , Xr) such that Ai > A2 > . . . , > 
Xk- We then write 

EH,logdet(I^,+H2KH^) 

= logdet(I^, +H2KH") + Eg, logdet(IiV2 +H2Hf) - Eh, logdet(I^, +H2H«) 

(57) 

> Eh, logdet(I^, +H2H« + (I^, +H2H«)5H2KH« ((I^v, +H2H^)^)'') 

-iV2EH,logdet(H-||H2||^) (58) 

> Eh, logdet(I^, +H2(lAf +K)H^) - iV2logdet(l + IIH2III + MN2a^) (59) 

= Eh, logdet(I^, +H2(Im +K)H«) + 0(1) (60) 

where (|591 ) has been obtained by applying Jensen's inequality. We define A' = diag(Ai, A2, . . . , 
as the matrix containing the A^^i largest eigenvalues from A and we proceed from (|60] ) as 

Eh, logdet(I;v.+H2KH«) 

> Eh, logdet(I;v. +H2V(Im +A)V«H«) + 0(1) (61) 

> Eh, log det(Ijv, +H2V'(I^, +A') V'^H") + 0(1) (62) 
= Eh, logdet(I^, +^'{In, +A')$'^) + 0(1) (63) 

> ^ log det(I^, +A') + \og{a') + 0(1) (64) 

where we have defined = H2V' G C^^^^^ with V containing the A^^i largest eingen- 
vectors, i.e., such that 

K = V'A'( V')" + ( V - V) (A - A') (V - V')". (65) 
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Inequality a follows from the fact that det(I + X) > det(I + Y) if X ^ Y. Inequality h 
is verified because the Gaussian distribution remains invariant by multiplication with a 
deterministic rotation. Hence, can be written as ^'+$' with the elements of $ distributed 
as the elements of H2. 

As a consequence, the following lemma presented in [24] (although in a different form) 
can be applied to obtain inequality h. 

Lemma 3. Given a random matrix H = H + H G C"^'" {n < m < 2n), where H is 
independent oftl and has its entries distributed as i.i.d. A/c(0,cr^), and any K ^ with 
eigenvalues A = diag([Ai, A2, . . . , Am]), with Ai > A2 > ■ • ■ > Am > 0, «Y holds that 

E^logdet(I„+HKH«) > -logdet(A) + ^ -\og{a^) + 0(1). (66) 

m m 

• We now tum to deriving an upper bound for the first term in (146) . 
1 1 

—Eh, logdet(I^, +HiKHf) < —Eh, ^^^(l + l|Hi||^A,) (67) 
^ ^ i=i 

1 

^ ^ E + (II^iIIf + MN,a^)\) + 0(1) (68) 

max(||Hi||| + MiVia2),l) A,) + 0(1). 
i=i ^ ^ 

(69) 

From the upper bound (1691 ) and the lower bound (|64T i. we can then write 
-^Eh, logdet(I;v, +HiKH«) - -^Eh, logdet(I^, +H2KH«) 

iVi iV2 

^ i^E(Ml+[max(l|Hil|^ + MiV,a2),l)] A.)-log(l + A.)) - ^^^J^ log(a2) + 0(l) 

^=l 

(70) 



iVi-iV2, ^ 2 



log(a^) + 0(l) (71) 

where (TtTI) is obtained by observing that the sum of difference of logarithms in (TTOI ) remains 
bounded for any values taken by the A^. 
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